Visualization of large data sets using MDS combined with LVQ

نویسنده

  • Antoine Naud
چکیده

A common task in data mining is the visualization of multivariate objects using various methods, allowing human observers to perceive subtle inter-relations in the dataset. Multidimensional scaling (MDS) is a well known technique used for this purpose, but it due to its computational complexity there are limitations on the number of objects that can be displayed. Combining MDS with a clustering method as Learning Vector Quantization allows to obtain displays of large databases that preserve both high accuracy of clustering methods and good visualization properties.

منابع مشابه

A new approach for data visualization problem

Data visualization is the process of transforming data, information, and knowledge into visual form, making use of humans’ natural visual capabilities which reveals relationships in data sets that are not evident from the raw data, by using mathematical techniques to reduce the number of dimensions in the data set while preserving the relevant inherent properties. In this paper, we formulated d...

متن کامل

Large-Scale Multidimensional Data Visualization: A Web Service for Data Mining

In this paper, we present an approach of the Web application (as a service) for data mining oriented to the multidimensional data visualization. The stress is put on visualization methods as a tool for the visual presentation of large-scale multidimensional data sets. The proposed implementation includes five visualization methods: MDS SMACOF algorithm, Relative MDS, Diagonal majorization algor...

متن کامل

New Developments of Nonlinear Projections for the Visualization of Structures in Nonvectorial Data Sets

Aalto University, P.O. Box 11000, FI-00076 Aalto www.aalto.fi Author Teuvo Kohonen Name of the publication New Developments of Nonlinear Projections for the Visualization of Structures in Nonvectorial Data Sets Publisher School of Science Unit Department of Information and Computer Science Series Aalto University publication series SCIENCE + TECHNOLOGY 8/2011 Field of research Computer science ...

متن کامل

Combined compression and classification with learning vector quantization

Combined compression and classification problems are becoming increasingly important in many applications with large amounts of sensory data and large sets of classes. These applications range from aided target recognition (ATR), to medical diagnosis, to speech recognition, to fault detection and identification in manufacturing systems. In this paper, we develop and analyze a learning vector qu...

متن کامل

Fast multidimensional scaling through sampling, springs and interpolation

The term ‘proximity data’ refers to data sets within which it is possible to assess the similarity of pairs of objects. Multidimensional scaling (MDS) is applied to such data and attempts to map high-dimensional objects onto low-dimensional space through the preservation of these similarity relationships. Standard MDS techniques have in the past suffered from high computational complexity and, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008